LISTEN: A System for Locating and Tracking Individual Speakers

نویسندگان

  • Michel Collobert
  • Raphaël Féraud
  • G. Le Tourneur
  • Olivier Bernier
  • Jean-Emmanuel Viallet
  • Yannick Mahieux
  • Daniel Collobert
چکیده

Both visual and acoustical informations provide effec– tive means of telecommunication between persons. In this context, the face is the most important part of the person both visually and acoustically. We describe how the co– operation of image and audio processing allows to track a person’s face and to collect the audio information it pro– duces. We present detection techniques of regions of interest (e.g. moving regions of skin color), coupled with a neural network based face detector with a low false alarm rate, to locate and track faces. The system is connected to a nine microphone array adaptive beamforming which performs immediate beamforming. Visual and acoustical informa– tions from the speaker face are thus obtained in real time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Conditional Mixture of Neural Networks for Face Detection, Applied to Locating and Tracking an Individual Speaker

Abs t r ac t . We present a neural network approach to human face detection. Using a modular system, a conditional mixture of networks, we a r e able to detect front view faces as well as turned faces (up to 50 degrees) with excellent performances. This modular network is integrated into LISTEN, our face tracking system. It enables this system to detect and track in real-time faces in a variety...

متن کامل

Pedestrians Tracking in a Camera Network

With the increase of the number of cameras installed across a video surveillance network, the ability of security staffs to attentively scan all the video feeds actually decreases. Therefore, the need for an intelligent system that operates as a tracking system is vital for security personnel to do their jobs well. Tracking people as they move through a camera network with non-overlapping field...

متن کامل

Pedestrians Tracking in a Camera Network

With the increase of the number of cameras installed across a video surveillance network, the ability of security staffs to attentively scan all the video feeds actually decreases. Therefore, the need for an intelligent system that operates as a tracking system is vital for security personnel to do their jobs well. Tracking people as they move through a camera network with non-overlapping field...

متن کامل

Using a Novel Concept of Potential Pixel Energy for Object Tracking

Abstract   In this paper, we propose a new method for kernel based object tracking which tracks the complete non rigid object. Definition the union image blob and mapping it to a new representation which we named as potential pixels matrix are the main part of tracking algorithm. The union image blob is constructed by expanding the previous object region based on the histogram feature. The pote...

متن کامل

Speechreading Using Probabilistic Models Speechreading Using Probabilistic Models

A robust method for locating and tracking lips in gray level image sequences is described The method learns patterns of shape variability from a training set which constrains the model during image search to only deform in ways similar to the training examples Image search is guided by a learned gray level model which is used to describe the large appearance variability of lips Such variability...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996